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DISTRIBUTION FREE GOODNESS-OF-FIT TESTS FOR 
LINEAR PROCESSES 1 
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Universidad Carlos III, London School of Economics and Universidad 

Carlos III 

This article proposes a class of goodness-of-fit tests for the auto- 
correlation function of a time series process, including those exhibit- 
ing long-range dependence. Test statistics for composite hypotheses 
are functionals of a (approximated) martingale transformation of the 
Bartlett T p -process with estimated parameters, which converges in 
distribution to the standard Brownian motion under the null hypoth- 
esis. We discuss tests of different natures such as omnibus, directional 
and Portmanteau-type tests. A Monte Carlo study illustrates the per- 
formance of the different tests in practice. 



1. Introduction and statement of the problem. Let / be the spectral 
density function of a second-order stationary time series process {X(t)}t£z 
with mean \i and covariance function 

Cov(X(i),X(0))= r f(X)cos(Xj)d\, j = 0,±l,±2,.... 

J — n 

We shall assume that {X(t)}t£Z admits the Wold representation 

oo oo 

(1) X(t) =(j, + Y^ a(j)e(t - j) with a(0) = 1 and ^ a 2 (j) < oo, 

j=0 j=0 

for some sequence {e{t)} t <^i satisfying E(e(i)) = 0, and E(e(0)e(t)) = a 2 
if t = and = for all t ^ 0. Under (1), the spectral density function of 
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{X(t)} t £i can be factorized as 



a 2 



f(\) = —h(x), xe[o,7r], 

with ^(A):=|E^o«(j>' iA | 2 - 
Let 

(2) H = lh : £logh e (X)dX = O,0ee\, 

where C MP is a compact parameter space. Much of the existing time series 
literature is concerned with parametric estimation and testing, assuming 
that h belongs to 7i, that is, h = hg for some 9q G 0, because the parameter 
#o and the functional form of hg summarize the autocorrelation structure 
of {X(t)} t <=z- Notice that h G TL in (2) guarantees that a(0) = 1 in (1) and 
a 2 = min^ge 2 J n f(X)/hg(X)dX. For our purposes, a 2 can be considered a 
nuisance parameter, as well as the mean \i. 

Classical parameterizations that accommodate alternative models are the 
ARMA, ARFIMA, fractional noise and Bloomfield [4] exponential models 
(see [35] for definitions). For instance, in an ARFIMA specification, Ti con- 
sists of all functions indexed by a parameter vector 9 = (d,rj',S')', where 
066C (-1/2,1/2) xMP 1 x MP 2 , of the form 



(3) MA) 



1 



1 - e 



i\\2d 



E v (e 



AG [0,tt] 



<Me iA ) 

such that and are the moving average and autoregressive polynomials 
of orders pi and P2, respectively, with no common zeros, all lying outside 
the unit circle. 

Before statistical inference on the true value 9q is made, one needs to test 
the hypothesis Ho:h£Tt, which can be equivalently stated as 

(4) H : = - for a11 A G [0. H and some e o e e, 

Ge (vr) 7T 

where 

rA /(A) 



G e (X):=2 f fUrdA, Ag[0,tt] 
Jo MA) 



h e (X) 

Under Hq, Gg is the spectral distribution function of the innovation process 
{e{t)} t £i and G 6o (tt) = a 2 . 

Given a record {X(t)}J =1 and a consistent estimator 9t of 6q under Hq, 
a natural estimator of Gq is defined as Gg T) x(X), where 

09 ^-fTfff »^M. 
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Here T = [T/2], [z] being the integer part of z, and for a generic time series 
process {V(t)} teZ , 

2 



i 



2vrT 



e it\j 



/=i 



j = l,...,T, 



denotes the periodogram of {V(t)}f =1 evaluated at the Fourier frequency 
Xj = 2irj/T for positive integers j. 

The formulation of Hq in (4) suggests use of Bartlett's T p -process as a 
basis for testing Hq. The T p -process is defined as 



,t(A) :=fV2 



Ae [o,7r]. 



Notice that a^.y is scale invariant and that, for j ^Omod(T), Iy{\j) is 
mean invariant, so omission of j = in the definition of Gq,t entails mean 
correction. That is, olqt is independent of both \i and a 2 . 
Under short-range dependence and Hq, we have that 



max_E 

l<j<T 



see [7], Theorem 10.3.1, page 346. So, it is expected that ag 0: T will be asymp- 
totically equivalent to Bartlett's C/p-process for {e(t)}tez, 

~ G° T (\) _ A" 
G° t (tt) ~ n_ 



a° T (X) :=f 1 ' 2 



AG [0,vr], 



with 



2tt 



[txm 



1 3=1 



Ae [0,vr]. 



In fact, under suitable regularity conditions, we shall show below that the 
aforementioned equivalence also holds under long-range dependence. Ob- 
serve that the [/^-process and the T p -process ag 0i x are identical when 
{X(t)}t£Z is a white noise process. 

The C/p-process is useful for testing simple hypotheses when the inno- 
vations {e(t)}J =1 can be easily computed, as is the case when {X(t)}tez is 
an AR model. However, there are many other models of interest whose in- 
novations {e(t)}f =1 cannot be directly computed, for example, Bloomfield's 
exponential model, or difficult to obtain, as in models exhibiting long-range 
dependence, such as ARFIMA models. In those cases, it appears computa- 
tionally much simpler to use oce ,T for testing simple hypotheses. 

The empirical processes and olq ti with fixed 8, are random elements 
in £?[0,7r], the space of right continuous functions on [0,ir] with left-hand 
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side limits, the cadlag space. The functional space D[0,7r] is endowed with 
the Skorohod metric (see, e.g., [3]) and convergence in distribution in the 
corresponding topology will be denoted by "=>" . 

Under suitable regularity conditions on {e(i)} ig ^, it is well known that 

(6) a° T ^Bl 

where B\ is the standardized tied down Brownian motion at n. In terms of 
the standard Brownian motion B on [0, 1], B\ can be represented as 

BI(X)=b(-)-^B(1), A €[0,4 

\7T/ 7T 

Grenander and Rosenblatt [18] proved (6) assuming that {e(t)} t <=.z is a 
sequence of independent and identically distributed (i.i.d.) random vari- 
ables with eight bounded moments. The i.i.d. condition was relaxed by 
Dahlhaus [10], who assumed that {e(t)} t £z behaves as a martingale dif- 
ference, but still assuming eight bounded moments. Recently Kliippelberg 
and Mikosch [27] proved (6) under i.i.d. {e(i)}j g z> but assuming only four 
bounded moments. The i.i.d. requirement is relaxed by the following as- 
sumption: 

Al. The innovation process {e(t)}t<=z satisfies E(e(i) r | Ft-i) = fJ-r with \i T 
constant (fj,i = and Hi = a 2 ) for r = 1, . . . ,4 and all t = 0, ±1, . . . , 
where Tt is the sigma algebra generated by {e(s),s < t}. 

Assumption Al appears to be a minimal requirement to establish a func- 
tional central limit theorem for otj>, due to the quadratic nature of the pe- 
riodogram. 

To establish the asymptotic equivalence between a# 0i T an d o^, we intro- 
duce the following smoothness assumptions on h: 

A2. (a) h is a positive and continuously differentiable function on (0, 7r]; 
(b) \dlogh(X)/dX\ =0(A" 1 ) as A^0+. 

This condition is very general and allows for a possible singularity of h at 
A = 0. It holds for models exhibiting long-range dependence, like 
ARFIMA(p2 5 d,pi) models with d ^ 0, as can easily be checked using (3) 
and that |1 - e iX \ = |2sin(A/2)|. 

Theorem 1. Assuming Al and A2, under Hq, (6) holds and 
sup \ate 0)T (\) - cxtWI = Opi 1 )- 

AG[0,7T] 

We can relax the location of the possible singularity in h at any other 
frequency A ^ 0, as in [23] or, more recently, [14], or even allow for more 
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than one singularity. However, for notational simplicity we have taken the 
singularity, if any, at A = 0. If the location of the singularity is at A 7^ 0, 
then A2 is modified to the following: 

A2'. (a) h is a positive and continuously differentiable function on [0, A ) U 
(AV] ; 

(b) |<91og/i(A)/<9A| = 0(\X - A ]" 1 ) as A^ A . 

We now comment on the results of Theorem 1. This theorem indicates that 
c%,T is asymptotically pivotal. One consequence is that critical regions of 
tests based on a continuous functional (p : D[0,ir] 1— > R can be easily obtained. 
Different functionals (p lead to tests with different power properties. Among 
them are omnibus, directional and/or Portmanteau-type tests. For exam- 
ple, classical functionals which lead to omnibus tests are the Kolmogorov- 
Smirnov (</?(<?) = sup Ag [ 0j7r ] |<?(A)|) and the Cramer-von Mises (p(g) = 
n^ 1 Jq g(X) 2 dX), whereas Portmanteau tests, defined as weighted sums of 
squared estimated autocorrelations of the innovations, and directional tests 
are obtained by choosing an appropriate functional (p\ see Section 3 for de- 
tails. 

On the other hand, in practical situations the parameters 6q are not known 
and, thus, they have to be replaced by some estimate 9t- In this situation, as 
Theorem 2 below shows, the T p -process is no longer asymptotically pivotal 
and, hence, the aforementioned tests are not useful for practical purposes. 
The unknown critical values of functionals of the T p -process with estimated 
parameters can be approximated with the assistance of bootstrap methods. 
This approach has been proposed by Chen and Romano [9] and Hainz and 
Dahlhaus [19] for short-range models using the [/^-process and by Delgado 
and Hidalgo [11], who allow also long-range dependence models using the 
Tp-process. Alternatively, asymptotically distribution free tests can be ob- 
tained by introducing a tuning parameter that must behave in some required 
way as the sample size increases. Among them, the most popular one is the 
Portmanteau test, although it has only been justified for testing short-range 
models. Box and Pierce [5] showed that the partial sum of the squared 
residual autocorrelations of a stationary ARMA process is approximately 
chi-squared distributed assuming that the number of autocorrelations con- 
sidered diverges to infinity with the sample size at an appropriate rate. A 
different approach, in the spirit of Durbin, Knott and Taylor [12] for the 
classical empirical process, is that in Anderson [2], who proposed to ap- 
proximate the critical values of the Cramer-von Mises tests for a stationary 
AR model. The method considers a truncated version of the spectral rep- 
resentation of ag Tt T with estimated orthogonal components. The number of 
estimated orthogonal components must suitably increase with the sample 
size. A similar idea was proposed by Velilla [46] for ARMA models. Finally, 
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another alternative uses the distance between a smooth estimator of the 
spectral density function and its parametric estimator under Hq. This ap- 
proach provides asymptotically distribution free tests for short-range models 
assuming a suitable behavior of the smoothing parameter as the sample size 
diverges; see, for example, Prewitt [34] and Paparoditis [33]. However, the 
final outcome of all these tests depends on the arbitrary choice of the tun- 
ing/smoothing parameters, for which no relevant theory is available. 

This article solves some limitations of existing asymptotically pivotal 
tests, only justified under short-range dependence, by considering an asymp- 
totically pivotal transformation of ote T ,T related to the cusum of recursive 
residuals proposed by Brown, Durbin and Evans [8]. We show that our 
testing procedure is valid under long-range specifications. In the next sec- 
tion we provide regularity conditions for the weak convergence of cng T x and 
its asymptotically distribution free transformation. In Section 3 we discuss 
the behavior of tests of a very different nature — omnibus, directional and 
smooth/Portmanteau — under local alternatives converging to the null at the 
rate T" 1 / 2 . Section 4 reports the results of a small Monte Carlo experiment. 
Some final remarks are placed in Section 5. Section 6 provides lemmata with 
some auxiliary results, which are employed to prove, in Section 7, the main 
results of the paper. 

2. Tests based on a martingale transformation of the T p -process with 
estimated parameters. A popular estimator of 9q is the Whittle estimator 

(7) 6t '■= argminGg i T(7r), 

e>ee 

with Gg t T defined in (5). Let us define 

8 -, T 

<MA) := ^ log MA), S T := ~$>ft,(Wft,(*i) 

j'=i 

and introduce the following assumptions: 

A3, (a) 4>g is a continuously differentiable function on (0, 7r]; (b) \\d(j>g Q (X)/ 
d\\\ = 0(1/ X) as A -> 0+; and for some < 5 < 1 and all A G (0,7r], 
there exists & K < oo such that (c) sup^g. ||g_g || <( 5} ||<^e(A)|| < K\ logA|; 
(d) 

1 hg (A) 



SU P lift a 112 

{6: ||0-0 o ||<<5/2} IF ~ ^0|| 



MA) 



1 + <4(A)(0-0 O ) 



< 5 log 2 A; 



X s 



and (e) T,q := tt Jq </ ) 6» (A)</ > e (A) dX is positive definite. 



These assumptions are standard when analyzing the asymptotic distribu- 
tion of the Whittle estimator 9t and they are satisfied for all parametric 
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linear processes used in practice. Standard ARMA models satisfy a stronger 
condition, replacing the upper bounds in A3(c) and (d) by a constant inde- 
pendent of A. It can easily be shown that A3 is satisfied for ARFIMA models. 
Note that A3(e) and Lemma 1 in Section 6 imply that St is positive definite 
for T large enough. 

A4. The estimator in (7) satisfies the asymptotic linearization 

(8) f 1 / 2 (e T -e ) = S T - 1 l % cj> eo {X)ae 0>T {dX)+o p {l). 

Jo 

The expansion (8), in assumption A4, is satisfied under A1-A3 and ad- 
ditional standard identification conditions; see [15, 20] or [45] for a later 
reference. 

Define 

aoo (A) := Bl(X) - (± jf * <4 (A) dXj S,- 1 jT fa (X)Bl(dX). 



Theorem 2. Under Hq and assuming A1-A4, uniformly in X G[0,7r]: 

[TX/n 



(a) a 6T>T (X) = a T (X)-(l £ ^ (A,) j^ 1 J\g (X)a T (dX)+o p (l); 

(b) ag T)T ^a 00 . 



Theorem 2 indicates that the asymptotic critical values of tests based on 
ag Ti T cannot be tabulated. However, we can use a transformation of q.q T} t 
that converges in distribution to the standard Brownian motion. To this 
end, it is of interest to realize that Theorem 2(a) provides an asymptotic 
representation of ag T) x as a scaled cumulative sum (cusum) of the least 
squares residuals in an artificial regression model. For that purpose, observe 
that by (2), and using the fact that 4>q is integrable [A3(c)], 

(9) <f> 9o (X)dX = 0. 

Jo 

Now, because Lemma 1 in Section 6 with £(A) = (fie (X) and (9) imply 

that || J2k=i "MjO^OII = O(logT), the uniform asymptotic expansion in The- 
orem 2(a) indicates that 



sup 

Ag[0,tt] 



2vr 1 [TXM 



ae -' T{X) - G^)Wf2 g UT{J) 
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where 



u T (j)=I £ (\ j )-j , go (\ j ) 



k=l 



-1 T 
k=l 

3 = 1,..., f, 



are the least squares residuals in an artificial regression model with de- 
pendent variable I e (\j) and a vector of explanatory variables 7e (Aj) := 
(1,4>'q (Aj))'. This fact suggests employing the cusum of recursive residuals 
for constructing asymptotically pivotal tests, as were proposed by Brown, 
Durbin and Evans [8]; see also [39]. 
Let us define 

1 f 

1 k=j+i 

and assume the following: 

A5. Aq 0i t{T) is nonsingular for T = T — p — 1. 

The (scaled) cusum of forward recursive least squares residuals is defined 

as 

where 

e T (j) := IsiXj) - 7^ (A,)6t(j), 3 = 1, ■ • ■ ,T, 
are the forward least squares residuals and 

b T (j):=A^] T (j)l fl 7e (A fc )/ £ (A fc ). 

1 k=j+l 

It is worth observing that the motivation to employ only the first T Fourier 
frequencies to compute the recursive residuals is due to the singularity of 
A e , T (j) for all j>T. 

The empirical process (3j> can be written as a linear transformation of a^, 

f3° T {X) = Ce 0) ToP T (X), Ag[0,tt], 
where, for any function g € D[0,n], 

^,T5(A) = 5 ~A - ~ £ Yfl(Aj)^O') / 7fl(A)^(dA). 
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The transformation £e> ,T has the limiting version £°, defined as 
C°g(\)=g(\)-± f X j' eo (X)Aj^(X) C je (X)g(dX) dX, 



7T JO 

where 

AooW ■= [\e (XH Wd~X. 

J A 

Notice that /^ a oo is the martingale innovation of see [25]. 

This type of martingale transformation has been used by Khmaladze [25] 
and Aki [1] in the standard goodness-of-fit testing problem, by Nikabadze 
and Stute [32] for goodness-of-fit of distribution functions under random cen- 
sorship, by Stute, Thies and Zhu [42], Koul and Stute [28, 29] and Khmaladze 
and Koul [26] for dynamic regression models, and by Stute and Zhu [43] for 
generalized linear models. 

Henceforth, B W (X) :=B(X/n) for AG [0,vr]. 

Theorem 3. Under Hq and assuming A1-A5, 

Because cannot be computed in practice, as it depends on 8q, it is 
suggested one use Pe T ,T, where 

0O,t(X) ■= Cg t TCte,T(X) 

2n 1 [7FXM 



J2 e e,r(j), X £ [0, 7r], 



and 

ng{Xj) 

are the forward recursive residuals in the linear projection of Ix(Xj) /he(Xj) 
on "fe(Xj), and where 

In order to establish the asymptotic equivalence between fij, and Pe T ,T, 
we also need some extra smoothness assumptions on the model under the 
null. 
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A6. For some < 5 < 1 and all A € (0, n] , there exists a constant K < oo 
such that 

SU P . im 1 p UeW ~ <M,(A) - fo o {X){0 - O )|| < K\ log A|, 



{9: \\0-0 \\<8} 



and cj)g satisfies A3(a)-(c). 

This assumption holds for all models used in practice, such as ARFIMA 
in (3), Bloomfield's exponential model and the fractional noise models men- 
tioned before. In fact, they satisfy even the stronger condition with K\ log A| 
replaced by K. 

Theorem 4. Under Hq and assuming A1-A6, 

sup |/3 0T)T (A)-^(A)|=o p (l). 

AG[0,7r] 

Theorem 4 holds true, mutatis mutandis, with 9t replaced by any T 1 / 2 - 
consistent estimator. Also, from a computational point of view, it is worth 
observing that 

An T (j) = An T (j +1) — ; 

and 



be,rU) = b e,rU + 1) + A e rUhefrj 



hg(Xj) 



ie(^)h,T{j + l) 



see [8] for similar arguments. 

Alternatively to (3q Ti t, we could have considered the cusum of backward 
recursive residuals, that is, 

2vr 1 [¥XH 

where 

ee,TU) ■■= T^IT " TeMhrU), j = p + 1, . . . ,T, 

b e>T (j):=A^ T (j)ij2^k)^^ and A e>T (j) := I £ le ^kH^k). 
In this case, we can take advantage of the computational formulae 

AgUj + l)=AgUj) 



Ag^(jhe(X j+ ih e (X j+ i)A 9 Uj) 



T + %(X j+1 )A-Uj) 7e (X J+1 ) 
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and 



Ix(X 



■3+1 j 



j' e (X j+1 )b e , T (j) 



9 ,tU + 1) = b e ,T(J) + A eUj + 1 )7e(A i+ i) 

lho{Aj +lj 

This formulation may be useful in small samples when we suspect that the 
main discrepancy between the null and the alternative is near ir. However, 
from Theorems 3 and 4, it is easily seen that the empirical processes (3e T ,T 
and Po Tj t have the same asymptotic behavior. 

Let ip:D[0,n] — > K be a continuous functional. Under Hq and the condi- 
tions in Theorem 4, 



as a consequence of the continuous mapping theorem. For instance, 



Kr, 



sup 

3=l,-,T 



Pe T ,T\ ■=- 



■ sup \B n (X)\ = sup |-B(u)|, 
Ae[0,7r] we[o,i] 



1 



T 



3=1 



J7T 

T 



4i 



Bl(X)d\= / B 2 {u)dw. 



The above limiting distributions are tabulated; see, for example, [40], pages 
34 and 748. 



3. Local alternatives: omnibus, directional and Portmanteau tests. In 

this section we shall show that tests based on f3g Tt T are able to detect local 
alternatives of the type 

Hvr : h(\) = hg (A) (l + r J^l(X) + i*r(A)) , 

As [0, 7r] and for some 9o 

where l(X) dX = 0, l(X) satisfies the same properties as (j>g in A3(a)-(c), r 
is a constant, possibly unknown, and for some finite To, |st(')I i s integrable 
for all T >Tq. Let us consider some examples. 

Example 1. If we wish to study departures from the white noise hy- 
pothesis in the direction of fractional alternatives, we have 

h(X) 1 x r . 

hooW |2sin(A/2)P<i/^' 

for some d ^ 0. By a simple Taylor expansion up to the second term, 

l(X) = -2 log |2sin(A/2)| and r = d, 

respectively, with the remainder function st being such that, for some < 
e < 1, |st(A)| < i^|A| _e for all large T and some K < oo. 
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Example 2. If we consider departures in the direction of MA(1) alter- 
natives, we obtain 

MA) 1 „ , 1 2 



1-r]^— 2cos(A) + ^t? 2 , Ae[0,7r]. 



h 6o (X) 'TV2 T 

Thus, r = 77, Z(A) = — 2cos(A) and sy(A) = r/ 2 . 

Example 3. If we consider departures in the direction of AR(1) alter- 
natives, then 



h(X) 



1-5 J— 2cos(A) + i(5 2 



n -1 



AG [0,vr]. 



he a {X) 

Thus, t = 5 and /(A) = 2cos(A) with |sr(A)| < K, for all large T and some 
K < 00. 

For A G [0, 7r] , let us define 

(10) L(A) := ~ | q A {/(A) - 7 ^(A)A^(A)i J* jg (A)Z(A) dA j dX 
and 

M{X):=B 7r {X) + T-L(X), Ae[0,vr]. 
We have the following theorem. 

Theorem 5. Assuming the same assumptions as in Theorem 4, under 
Hit, 

Pe T>T M. 

Using the fact that M and B n are identically distributed, except for the 
deterministic shift r • L, and taking into account that 2 1//2 sin((j — 1/2) A) 
and l/(j — l/2) 2 7r 2 are the eigenfunctions and eigenvalues in the Kac-Siegert 
representation of B n [24], the orthogonal components of M, 

m{j):=2 1 ' 2 {j-\) f sin((j-±)A)M(A)dA, j = 1,2,..., 



are independently distributed normal random variables with mean r • £(j) 
and variance 1, where 

£(j) = 2 1 / 2 (j-l) r S m((j-l)X)L(X)dX, j = 1,2,.... 

w 

Using the (asymptotically) orthogonal components of Pe T ,T, 

m T (j) = 2 1 / 2 (j-i) r S m((j-l)X)p eT , T (X)dX, 3 = 1,2,..., 
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we obtain the spectral representation, 

a m oi/2^ ^T(i)sin((j-l/2)A) 

^ t ,t(A) = 2 2^ — _ ^ , AG[0,vrj. 

By Theorem 5 and the continuous mapping theorem, finitely many of the 
mr(j)'s converge in distribution to the corresponding m(j)'s under H\t- 
Using Parseval's theorem, 

Using similar arguments to those in [13] in the context of the standard 
empirical process with estimated parameters, tests based on 

n 
3=1 

with a reasonable choice of n > 1, will lead to gains in power, compared to 
Ct, in the direction of alternatives with significant autocorrelations at high 
lags. These Portmanteau tests are related to Neyman's [31] smooth tests, 
a compromise between omnibus and directional tests, and for each n > 1, 
under Hit we have that 




That is, tests based on W n< T are asymptotically pivotal under Hq (t = 0) 
for each choice of n, and more importantly, they are able to detect local 
alternatives converging to the null at the parametric rate T -1 / 2 , provided 
that £(j) ^ for some j = 1, . . . , n. The latter is in contrast with the classical 
Portmanteau tests based on 

TIT 

(ii) Q« T ,r:=£( Tl/2 Mi)) 2 , 

3=1 

where Pt{j) is some estimate of the jth autocorrelation of the residuals. It 
has been shown that Q nr ,T is approximately distributed as a Xn T -p under 
Hq specifying a short-range model and assuming that ut diverges as T — > oo. 
On the other hand, the resulting test is able to detect alternatives converging 
to the null at the rate n^/ 4 T -1 / 2 (see, e.g., [21]), which is slower than T -1 / 2 . 
In practice, it is recommended that one use the discrete version 

n 
3=1 
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of W n ,Ti with 

On the other hand, optimal tests of Hq in the direction Hit can be 
constructed applying results in [16] (see also [17] and references therein), 
as was suggested by Stute [41] in the context of goodness-of-fit testing of a 
regression function. Asymptotically, testing for Hq in the direction of Hit 
is equivalent to testing Hq '■^■( m (j)) = for all j > 1, against H\ :E(m(j)) = 
r for all j ' > 1 with L known, but maybe with unknown r . Under Ho, the 
distribution of {m(J)}j>i is completely specified, as it is also under Hi when 
the parameter r is known. Then the likelihood-ratio for a finite-dimensional 
set (m(l), . . . , m(n)) is 




Grenander [16] showed that A n — > p as n — > oo, and that the most pow- 
erful test at significance level a has a critical region of the form {Aoo > k}, 
with Pq{A oq > k} = a if Y^jLi^ 2 {j) < 00 ■ The latter condition is satisfied in 
our context by Parseval's theorem and A3(c) because I is a square integrable 
function. 
Define 

Then under Hq, i/j = N(0, 1), and in view of (12), -0 forms a basis to obtain 
optimal critical regions. When the sign of r is known, the critical region 
of the uniformly most powerful test at significance level a is {ip > zi- a } 
when r > and {ip < — zi- a } when r < 0, where z v is the v quantile of the 
standard normal. Also, when the sign of r is unknown, the most powerful un- 
biased test at significance level a has critical region given by {\ip\ > 24-0/2} • 
These arguments suggest an (asymptotically) optimal Neyman-Pearson 
test in the direction of Hit based on the first n orthogonal components of 
Pd T ,T, using the test statistic 

- _ £?=il(j)-m r (j) 

Schoenfeld [38] proposes the same type of statistic in the standard goodness- 
of-fit testing context. Under Hq and the assumptions in previous sections, 
we have that 

$n t T —* A(0, 1) as T — > 00 for each fixed n. 
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Also, arguing as in Schoenfeld's [38] Theorem 3, the convergence in dis- 
tribution of ipn T ,T when ut increases with T can be shown. Approximately 
optimal tests for Hq in the direction of H\t reject Hq at significance level a 
when |VVi t ,t| > Z\-al2 if r h as unknown sign, VVi T .T > ^i-q when r > and 
^n T ,T < — Zi-a when r < 0. 

4. Some Monte Carlo experiments. A small Monte Carlo study was car- 
ried out to investigate the finite sample performance of the different tests. 
To that end, we considered the AR(1), MA(1) and ARFIMA(0, do, 0) models 

(13) (l-S L)X(t)=e(t), 

(14) X(t) = (1 - m L)e(t), 

(15) (l-L) d °X(t) = e(t), 

respectively, where the parameter 9q equals So, rjo and do for the different 
models and L is the lag operator. The innovations {e(t)}f =1 are i.i.d. J\f(0, 1), 
and the sample sizes used are T = 200 and 500 with different values of 
the parameters So, rjo and do- For models (13) and (14), we considered So, 
m = -0.8,-0.5,0.0,0.5,0.8, whereas for model (15) d = 0.0, 0.2, 0.4. The 
ARFIMA model was simulated using an algorithm by Hosking [22]. 

For the three models and all values of 0o, we computed the proportion 
of rejections in 50,000 generated samples for both sample sizes. Whittle 
estimates are obtained according to (7). For each of the models considered 
4>e is given by 

AR(1), e = S:^) = i-Ml-Se^' 



85 bl 1 l-25cosA + 5 2 ' 

MA(1), = 7 ? :^(A) = ^-log|l- 77 e iA|2 -' 1 



drj 1 — 2rj cos A + rf- ' 

ARFIMA(0, d, 0), 9 = d : 6AX) = ^- log 1 1 - e iX \ ~ 2d = -2 log 12 sin(A/2) I . 

od 

We also report, as a benchmark, the proportion of rejections using 
^:= 1 r«LT(A)dA = T£^#, 

TT JO TT J 

j = l J 

which is suitable for testing simple hypotheses. In addition, for the sake of 
comparison, we provide the results for the Box and Pierce [5] test statistic 
(11) using several values of ht increasing with T, where pr(j), j > 1, are 
the sample autocorrelations of the residuals {e(t)}f =1 . Specifically, for the 
AR(1) model, 

e(t) = (l-S T L)X{t), 
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with X(t) = for t < 0; for the MA(1) model, 

e(t) = X(t)-rtri(t-l), 

with e(0) = 0; and for the ARFIMA(0, d, 0) model, 

t-i 

i=o 

where d) are the coefficients in the formal expansion 

oo 
3=0 

with 

The standardized values of Q nT ,T, {Qn T ,T — n T)l V^ n T are compared with 
the 5% critical value of the standard normal (see Hong [21]) instead of the 
usual X(„ T _i) approximation correcting by the loss of degrees of freedom 
due to parameter estimation, which is justified under Gaussianity. The two 
approximations provide a similar proportion of rejections. We also tried the 
weighting suggested by Ljung and Box [30], which produced very similar 
results. 

First we analyze the size accuracy of the Cramer-von Mises test based 
on Po t t- The empirical sizes of the tests based on Ct, reported in Table 1, 
are reasonably close to the nominal ones. The asymptotic approximation 
improves noticeably when the sample size increases from T = 200 to T = 500, 
this improvement being uniform for all the models, although the empirical 
size is smaller than the nominal level. Tests based on Q nT ,T have serious 
size distortions for the smaller sample size and large values of \rj\ in the 
MA(1) model, since Whittle estimates can be quite biased in these cases. The 
empirical size of tests based on Q UT) t depends substantially on the number of 
autocorrelations used. In addition, for the larger choices of ut implemented, 
Qn T ,T over-rejects Hq. The usual recommendation nx = o(T 1 / 2 ) also seems 
reasonable here, in terms of size accuracy. 

Next we study the power performance of the tests. To this end, we report 
first, in Table 2, the proportion of rejections under the alternative hypothesis 
for different nonnested specifications with the model specified under the null. 
We cannot conclude that one test is clearly superior to the others in any 
of the four cases analyzed. As expected, the power of the Portmanteau test 
decreases as increases. In view of Tables 1 and 2, we can conclude that 
a choice of large ut, around T -1 / 2 , produces reasonable size accuracy, but 
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Table 1 

Empirical size of omnibus and Portmanteau tests at 5% significance 



T = 200 T = 500 



Ct Ct Qs,T Qg,T QlO,T Q20,T Ct Ct Qs,T Qe,T Ql5,T C?35,T 



5 ,H :AR{\) 

-0.8 4.92 4.69 3.34 3.72 3.91 3.61 5.07 5.17 3.56 3.87 4.35 3.97 

-0.5 4.38 4.96 2.80 3.38 3.60 3.41 4.96 5.16 3.12 3.75 4.17 3.82 

0.0 4.07 4.96 2.66 3.35 3.45 3.37 4.62 5.10 3.00 3.63 4.11 3.82 

0.5 3.59 4.95 2.67 3.33 3.57 3.40 4.50 5.04 2.97 3.82 4.17 3.80 

0.8 3.08 4.92 2.89 3.44 3.73 3.54 4.27 5.11 3.33 3.77 4.32 3.88 

770, #o :MA(1) 

-0.8 4.25 8.37 4.32 4.54 4.42 3.95 4.89 6.67 4.13 4.39 4.56 4.07 

-0.5 4.16 5.06 2.83 3.41 3.65 3.38 4.89 5.18 3.13 3.76 4.15 3.83 

0.0 4.08 4.96 2.51 3.26 3.46 3.32 4.62 5.10 2.94 3.61 4.05 3.82 

0.5 3.60 5.08 2.65 3.30 3.55 3.41 4.49 5.15 2.96 3.77 4.13 3.82 

0.8 3.89 7.72 15.33 15.30 15.33 15.05 4.63 6.42 8.03 8.44 8.68 8.17 

do,H :I(d) 

0.0 3.53 4.96 2.76 3.40 3.68 3.47 4.48 5.10 3.13 3.90 4.29 3.83 

0.2 3.54 4.95 2.76 3.39 3.63 3.46 4.54 5.15 3.14 3.89 4.27 3.81 

0.4 3.58 5.21 2.79 3.39 3.59 3.44 4.58 5.37 3.14 3.88 4.27 3.80 



such a choice is not the best possible one in order to maximize the power. 
The test based on Ct is fairly powerful compared to the Portmanteau test 
for all cases considered, and it works remarkably well when testing an AR(1) 
in the direction of an MA(1) alternative. 

Finally, we analyze the power of the different tests when testing an AR(1) 
specification in the direction of local ARFIMA(1, d, 0) with d = t/T 1 ^ 2 , and 
in the direction of local ARMA(1, 1) alternatives with moving average pa- 
rameter T] = t/T 1 / 2 , for different values of r. The proportion of rejections for 
these designs is reported in Tables 3 and 4. We also consider tests based on 
the test statistics W n% T and ip n .T (one-sided and two-sided, ip^T an d Wu,t\ 
resp.), choosing n = 3 and 6, which has been recommended by Stute, Thies 
and Zhu [42] for a different goodness-of-fit test problem. Of course, tests 
based on the first n (asymptotic) orthogonal components of (3e T ,T are sensi- 
tive to the choice of n, as also happens with tests based on the n (asymptotic) 
orthogonal components of ag T: T (the estimated autocorrelations of the in- 
novations) in Portmanteau tests. The omnibus test based on Ct still works 
fairly well compared to the others, including the optimal and smooth tests. 
The directional tests are the most powerful in the directions for which they 
are designed, and the tests based on W n .T and Qn T ,T work very similarly, 
though W n T exhibits a better size precision for the choices of n considered. 
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Table 2 

Empirical power of omnibus and Portmanteau tests at 5% significance 



T = 200 T = 500 





Ct 




Qe 


,T 


QlO,T 


Q20,T 


Ct 


Q-3 


,T 


Qe 


,i 


Ql5,T 


Q35,T 


V, H 
-0.8 


:AR(1), #i:MA(l) 
100.00 99.97 99 


95 


99 


.25 


92 


.34 


100 


.00 


100 


.00 


100 


00 


100 


.00 


100. 


.00 


-0.5 


80.82 


70.16 


55. 


53 


44 


38 
o o 


O 1 




QQ 


8 1 

O-l 


99 


.23 


97. 


54 


88 
oo 




R8 




0.2 


7.12 


5.04 


4 


.98 


4 


86 


4 


.34 


12 


16 


8 


.31 


7 


35 


6 


.27 


5 


.21 


0.5 


70.82 


72.03 


57. 


50 


46 


.06 


32 


.15 


98 


.59 


99 


.32 


97. 


83 


89 


.19 


69 


.29 


0.8 


99.56 


99.99 


99. 


95 


99 


.30 


92 


.76 


100 


00 


100 


00 


100. 


00 


100 


.00 


100 


.00 


5, Ho 
-0.8 


:MA(1), 
100.00 


#i:AR(l) 
100.00 100. 


00 


100 


.00 


99 


.99 


100 


.00 


100 


.00 


100. 


00 


100 


.00 


100. 


.00 


-0.5 


84.36 


77.15 


66. 


51 


57 


.37 


44 


.02 


99 


.73 


99 


47 


98. 


45 


94 


.26 


82, 


.89 


0.2 


7.16 


3.71 


3. 


99 


3 


.94 


3 


.63 


12 


.04 


6 


.65 


6. 


42 


5 


.73 


4 


.80 


0.5 


77.08 


74.86 


64. 


04 


54 


.79 


31 


.78 


99 


.19 


99 


41 


98 


35 


93 


.77 


82. 


.04 


0.8 


100.00 


100.00 


100. 


00 


100 


.00 


99 


.97 


100 


.00 


100 


.00 


100. 


00 


100 


.00 


100 


.00 


5, Ho 
0.2 


11.34 


:AR(1) 
12.84 


13 


00 


11 


.27 


13 


.13 


34 


92 


33 


.35 


33. 


01 


23 


.98 


15 


.71 


0.5 


26.81 


34.11 


41 


17 


35 


.55 


24 


.94 


75 


.29 


81 


.36 


87. 


.81 


80 


.73 


58 


.52 


0.8 


9.82 


12.86 


21 


01 


21 


.32 


15 


.41 


33 


.21 


38 


.74 


57. 


53 


61 


.63 


39 


.15 


d, H 
0.1 


:AR(1), H x :\{d) 
8.22 4.98 


5. 


66 


5 


11 


4 


.83 


16 


.79 


12 


.07 


14. 


09 


12 


.34 


9 


.10 


0.2 


19.90 


13.74 


16. 


20 


15 


.23 


11 


.81 


51 


.77 


45 


04 


53 


29 


47 


.54 


36 


.11 


0.3 


36.03 


25.92 


32. 


00 


30 


.50 


24 


.35 


82 


.80 


74 


84 


85. 


12 


81 


44 


69 


.62 


0.1 


48.83 


34.86 


43. 


.78 


43 


.31 


35 


.48 


94 


40 


87 


.30 


95. 


56 


94 


.31 


87 


.38 



5. Final remarks. Our results can be extended to goodness-of-fit tests of 
models that can accommodate simultaneously stationary and nonstationary 
time series. For instance, if the increments Y(t) := (1 — L)X(t), i = 0, ±1, . . . , 
are second order stationary with zero mean and spectral density g such that 

lim |A| 2(d-1) s(A) = G > for some d G [0.5, 1.5), 
we can define the pseudo-spectral density function of {X(t)} t t=z, /, as 

Thus, when d 7^ 1, g has a singularity at A = 0, as happens with many long- 
range dependent time series (cf. A2). If {X(t)}t£z is stationary, / becomes 
the standard spectral density function. 

If either {Y(t)}t£z or {X(t)} t t=z satisfies Wold's decomposition, / admits 
the factorization 

/(A) = 
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Table 3 

Empirical size and power under local alternatives at 5% significance 



H : AR(1), H x : ARFIMA(1, d = t/T 1/2 , 0) 













|V>3,t| 


\ipe,T\ 










T 


P 


Ct 




We,T 








Qe,T 














T = 200 










o 


0.0 


4.07 


3.19 


2.59 


4.70 


4.81 


4.48 


5.12 


2.66 


3.35 




0.5 


3.59 


2.98 


2.32 


3.79 


4.24 


3.62 


3.99 


2.67 


3.33 




0.8 


3.08 


2.52 


1.94 


3.94 


3.10 


3.75 


4.02 


2.89 


3.44 


1 


0.0 


6.26 


5.40 


4.37 


8.39 


11.13 


13.44 


16.63 


3.68 


4.25 




0.5 


3.57 


2.90 


2.26 


3.45 


4.19 


4.19 


5.64 


2.73 


3.37 




0.8 


3.01 


2.25 


1.66 


4.10 


4.52 


7.80 


8.53 


3.87 


4.41 


2 


0.0 


12.19 


12.04 


10.53 


19.93 


26.15 


28.94 


35.10 


7.80 


9.13 




0.5 


3.44 


2.91 


2.36 


3.47 


4.15 


4.25 


6.27 


2.91 


3.58 




0.8 


4.84 


3.16 


2.19 


9.17 


10.33 


16.59 


17.98 


8.45 


7.58 


3 


0.0 


21.92 


23.63 


21.27 


35.77 


44.37 


47.20 


54.61 


15.17 


18.02 




0.-") 


3.26 


2.74 


2.39 


3.65 


4.43 


4.99 


6.48 


3.27 


3.92 




0.8 


9.13 


6.61 


4.10 


20.13 


22.90 


31.95 


35.14 


21.18 


16.12 


4 


0.0 


33.38 


27.13 


24.15 


50.40 


59.39 


62.18 


69.12 


23.88 


29.88 




0.5 


3.41 


2.47 


2.38 


4.09 


4.75 


6.80 


7.61 


4.32 


4.67 




0.8 


17.48 


14.65 


9.09 


38.10 


43.37 


53.13 


57.56 


46.00 


33.97 














T = 500 













0.0 


4.62 


4.22 


3.66 


4.81 


4.78 


4.57 


5.06 


3.00 


3.63 




0.5 


4.50 


3.99 


3.40 


4.26 


4.58 


4.27 


4.43 


2.97 


3.82 




0.8 


4.27 


3.56 


3.09 


3.90 


3.85 


4.63 


3.63 


3.33 


3.77 


1 


0.0 


6.93 


7.03 


6.29 


9.35 


11.62 


14.63 


17.54 


4.37 


5.13 




0.5 


4.58 


4.42 


4.08 


4.85 


5.35 


58.30 


7.43 


3.02 


3.93 




0.8 


4.74 


4.13 


3.47 


5.72 


5.90 


9.61 


9.83 


4.12 


4.64 


2 


0.0 


14.22 


15.51 


14.23 


23.43 


29.37 


33.47 


39.37 


10.03 


11.60 




0.5 


4.69 


4.72 


4.67 


4.83 


6.49 


6.37 


10.18 


3.08 


4.21 




0.8 


7.36 


6.13 


4.73 


11.57 


12.08 


19.11 


19.81 


7.27 


7.38 


3 


0.0 


26.86 


31.03 


29.55 


44.70 


53.35 


56.44 


63.59 


21.28 


24.91 




0.5 


4.65 


5.04 


5.48 


4.71 


7.14 


5.44 


11.31 


3.30 


4.60 




0.8 


13.56 


11.62 


8.18 


23.46 


24.65 


34.56 


35.78 


15.23 


13.51 


4 


0.0 


43.62 


51.19 


49.81 


66.34 


74.28 


75.93 


81.84 


37.13 


43.93 




0.5 


4.65 


5.18 


6.35 


5.05 


7.03 


5.09 


10.80 


3.81 


5.09 




0.8 


24.44 


23.10 


16.17 


42.07 


44.05 


54.86 


56.23 


31.28 


25.74 



IV'k.tI denotes two-sided tests, whereas V>„ T are one-sided (right-hand side) tests. 



where h satisfies A2. Thus, given a parametric family TC, for example, the 
ARFIMA specification given in (3), a T p -process for testing that h E TC is 



A G [0, 7r] 
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Table 4 

Empirical size and power under local alternatives at 5% significance 



H : AR(1), H 1 : ARMA(1, 1), V = t/T 1/2 



T 


P 


/■ ' 


VT3,T 


We,T 




LA 1 


V>3,T 




Q3,T 


Q6,T 














T = 200 













0.0 


4.13 


3.09 


3.58 


3.98 


4.39 


4.18 


4.39 


2.65 


3.36 




0.5 


3.62 


2.80 


2.22 


3.68 


4.04 


3.93 


4.14 


2.67 


3.31 




0.8 


3.06 


2.38 


1.86 


3.00 


3.21 


3.45 


3.64 


2.93 


3.46 


1 


0.0 


4.22 


3.10 


2.58 


3.88 


4.23 


3.74 


3.93 


2.76 


3.40 




0.5 


5.52 


4.08 


2.90 


5.51 


5.76 


8.86 


9.20 


3.08 


3.61 




0.8 


7.81 


5.63 


3.66 


7.77 


7.98 


13.13 


13.62 


5.47 


5.05 


2 


0.0 


5.01 


3.50 


2.79 


3.77 


4.06 


3.36 


3.46 


3.45 


3.82 




0.5 


8.53 


6.10 


4.02 


8.58 


9.06 


14.33 


14.61 


4.51 


4.56 




0.8 


18.07 


13.73 


8.53 


20.63 


21.26 


30.93 


31.41 


12.52 


10.66 


3 


0.0 


7.79 


5.04 


3.76 


4.62 


4.92 


6.00 


6.06 


5.60 


5.32 




0.5 


10.64 


7.80 


5.16 


10.84 


11.25 


17.39 


17.87 


5.76 


5.41 




0.8 


32.10 


27.17 


17.65 


37.68 


38.18 


50.25 


50.49 


23.84 


20.09 


4 


0.0 


14.60 


9.51 


6.65 


10.86 


11.01 


16.70 


16.78 


11.03 


8.99 




0.5 


10.67 


8.16 


5.42 


10.65 


11.01 


17.11 


17.57 


5.93 


5.56 




0.8 


45.29 


42.62 


29.55 


52.48 


52.79 
T = 500 


64.96 


64.97 


36.18 


31.63 





0.0 


4.70 


4.43 


3.86 


4.66 


5.68 


4.52 


4.62 


2.99 


3.64 




0.5 


4.50 


4.23 


3.70 


4.53 


4.55 


4.50 


4.52 


2.99 


3.80 




0.8 


4.39 


3.94 


3.40 


4.22 


4.26 


4.37 


4.38 


3.34 


3.78 


1 


0.0 


4.74 


4.37 


3.83 


4.70 


4.75 


4.31 


4.35 


3.02 


3.70 




0.5 


6.68 


5.72 


4.73 


6.71 


6.61 


10.25 


10.36 


3.75 


4.32 




0.8 


9.56 


8.06 


6.00 


10.03 


10.08 


16.20 


16.28 


6.26 


5.82 


2 


0.0 


5.00 


4.47 


3.90 


4.76 


4.87 


3.61 


3.62 


3.34 


3.90 




0.5 


11.06 


8.94 


6.81 


11.48 


11.43 


18.23 


18.17 


6.06 


5.88 




0.8 


23.21 


19.66 


13.89 


26.87 


26.88 


38.01 


37.99 


15.66 


13.35 


3 


0.0 


6.31 


5.17 


4.38 


4.95 


5.03 


3.19 


3.18 


4.25 


4.55 




0.5 


16.44 


13.17 


9.58 


17.26 


17.24 


26.26 


26.03 


9.45 


8.39 




0.8 


42.78 


38.92 


28.30 


50.11 


49.91 


62.36 


62.42 


32.23 


27.37 


4 


0.0 


9.48 


6.98 


5.57 


5.09 


5.16 


4.09 


4.07 


6.40 


5.98 




0.5 


21.08 


17.22 


12.42 


22.10 


21.95 


32.15 


31.99 


12.84 


10.89 




0.8 


62.44 


60.69 


47.41 


70.99 


70.86 


80.69 


80.67 


52.01 


46.42 



|^n,:r| denotes two-sided tests, whereas ip^T are one-sided (right-hand side) tests. 



where G^ T is analogous to Gq^t, but using the tapered periodogram, for 
example, 
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Here 6t = argmin^e G^ T {ir) is the Whittle estimator proposed by Velasco 
and Robinson [45], which admits a similar asymptotic first order expansion 
as in (8), and where w is a taper function, for example, the full cosine taper 

If (2irt\ 

; I 



w{t) = -[l-cos^— J J, t = l,...,T. 

If the full cosine taper is used, because of its desirable asymptotic properties 
(see [44]), it is recommended in practice to base our tests on the empirical 
process Pq t , where 

/ p2 n 1/2 o m 

^T(A m ):=(^j ^r^yE«j)> m=l,...,T, 



with 



n e {Aj) 



and 



Under appropriate regularity conditions, it can be proved using tools in [44] 
and [45] that ^, |T =^J5 ff . 

Finally, the methodology can be extended to test the correlation struc- 
ture of the innovations of regression models (e.g., distributed-lags models) 
using the martingale part of the t/ p -process based on the residuals. When 
TK(z(t)u(s)) =0 for all t, s, where {z(t)}f =1 are the regressors and {u(t)}f =1 
the error term, the residual t/ p -process is asymptotically equivalent to the 
Up-process based on the true innovations, and there is no need to use tests 
based on the martingale part of the C/p-process. When K(z(t)u(t — s)) ^ 
for some s > 0, the first-order expansion of the residual C/p-process depends 
on the cross-spectrum of the innovations and regressors. However, it seems 
possible to apply the results in this paper to implement tests based on the 
(approximate) martingale part of this J7 p -process with estimated parame- 
ters. 



6. Lemmas. This section provides a series of lemmas which will be used 
in the proofs of the main results. Some of them can be of independent 
interest. Henceforth, denotes the kth element of a p x 1 vector z and K 
a finite positive constant. Also, we shall abbreviate g(Xj) by gj for a generic 
function g(\). 
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Lemma 1. Let ( : (0, n] —>W be a function such that ||C(A)|| < K\ log \\ l , 
£>1, and \\d((\)/d\\\ < KX^l logA| f_1 for all A > 0. Then, asT^oo, 



(16) 



sup 

ag[o,tt; 



[TA/tt 



d.r 



<K 



(logry 



Proof. The left-hand side of (16) is bounded by 





1 f X 




sup 


— / c( x ) dx 


+ sup 


Ae[0,7r/?) 


vr JO 


Ae[7r/f,7r] 



(17) 

The first term of (17) is bounded by 
1 r*/T 



C(x) <ix 



||C(x)||dx<X 



* /T | logxr<fc <^(^ 



Next, by the triangle inequality, the second term of (17) is bounded by 

i r*/T 



(18) 



sup 

A€[tt/T>] 



l l r^' 

T n Jo 



dx 



sup 

Ae[vr/f,7r] 



7T 



i=i 



jtt/T 



||Ci-C(aOI|dz. 



The first term of (18) is bounded by KT 1 (logT)^ since ||C(x)|| < -^1 logx| . 
Next, by the mean value theorem, the second term of (18) is bounded by 



r~L r (j+l)n/T 1 



A, 



J7T 
f 



lo, 



^ if (log T / 



^1 r(j+i)*/T 



■k/T 



logx| £ 1 dx 



□ 



The next lemma corresponds to Giraitis, Hidalgo and Robinson's [14] 
Lemma 4.4, which we state without proof for easy reference. For this pur- 
pose, let Uj := / l ~ 1/2 (27rT)- 1 / 2 Y%=i X(t)e itx i , Vj := (2-kT)- 1 / 2 ^f =1 e{t)e itx i 
and Rxs{X) be the spectral coherency ([6], pages 256-257) between X and 
s. Also, herewith c will denote the conjugate of the complex number c. 

Lemma 2. Assuming Al and A2, £/ien, as T — > oo ; i/ie following rela- 
tions hold uniformly over 1 < j < k <T : 

EfaVj) = Rxej + OCT 1 log(i + 1)); 
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E(u j v j ) = 0(j- 1 log(j + l)y, 
max(\E(u k Vj)\,\E(u k Vj)\) = O^" 1 log(fe)); 
m a x(\E(v k u 3 )\,\E(v kUj )\) = Otr 1 log(fc)). 

The next lemma corresponds to the proof of expression (4.8) of [37], pages 
1648-1651, using the orders of magnitude of the terms ai, 02, b± and 62 in [37] 
and Lemma 3 there, but using our Lemma 2 instead of Robinson's [36] 
Theorems 1 and 2 when appropriate. 

Lemma 3. Let £ : [0, tt] — > MP satisfy the same conditions on <pQ in A3(a)-(c) . 
Then, assuming Al and A2, as T — ► 00, for 1 < r < s < T, h = 1, . . . ,p: 

2 



E 



< /^iog 2 (T) £ j- 1 iog(r) + ^(r 2 iog 2 (r) + r x k- 1/2 ) 



k=r 



Lemma 4. Let ( :[0,ir] 
and write 



4(A) 



and 



a j, (A) 



1 



satisfy the same conditions on (j)g in A3(a)-(c) 

[TX/n] 



T l/2 



E U 



-) 



1 



[TA/tt] 



E 



hi 2tt 



rV2 £ 

Then, under the conditions of Theorem 1, /or some < 5 < 1/6, 
(19) E sup \\a C T {\) -c4(A)|| = 0(T~ S ). 

Ae[0,7r] 

Proof. It suffices to show that (19) holds for each element of the vector 
a.j,(\) — aj,(X). Then, by the triangle inequality the left-hand side of (19) is 
bounded by 



E sup 



1 



[TA/71-] 



(20) 



Ae[o l7 r]TV2 ^ 



Uj — Vj 



+ 2E sup 

AG[0,7r] 



[TA/tt] 

TE E CjVjiuj-Vj) 



T \/2 
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The first term of (20) is bounded by 



( E <w> - £ ) + ( E N 2 - £ 



by Lemma 2, because E|uj| 2 = (2ir) 1 cr 2 , and by assumption \Q \ < K logT. 

Next, to show that the second term of (20) is 0(T~ S ), it suffices to show 
that 



(21) 



E max_ 

s=l,...,T 



0(T~ 5 ). 



By the triangle inequality the left-hand side of (21) is bounded by 



(22) E max_ 
*=i,...,p*»] 



1 s 

^— EcfS(%-^-; 

1 3=1 



+ E 



[T /3] 



T-l/2 ^ 



(23) 



+ E jnax 

s=[T0]+l,...,T 



E cj*SK--^ 



j = [T/3]+l 



where A < /5 < \ . Using the inequality 



sup|cp|) = sup|c p | 2 <^2\c p \ 2 , 



(24) 

by the Cauchy-Schwarz inequality the square of (22) is bounded by 

[TP] s 2 



E E 



Ecrv 



3=1 



0(f 2p - 1 \og A T) = 0(T- 



2<5\ 



using Lemma 3. 

To complete the proof, we need to show that (23) = 0(T~ 5 ). To that 
end, let q = 0, . . . , [T ? ] — 1 with ^ < <j < j3. By the triangle inequality (23) is 
bounded by 



E - jnax 

T 1 ' 2 s=[TP]+l,...,T 



[TP]+q(8)T/[T^ 

E E 

^ = [T/3]+l j=[T0]+l ■ 



^ k) Vj(Uj-Vj) 
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(25) 



+ E - jnax 

T 1 / 2 8 =[T0]+1,...,T 



[TP]+q{s)T/[T"\ 

E cjSte 

j=[T0]+l 



where q(s) denotes the value of q = 0, . . . , [T">] — 1 such that [T@] + q(s)T/[T <; ] 
is the largest integer smaller than or equal to s, and using the convention 
Ec = if d < c. 

By the definition of q(s) and the Cauchy-Schwarz inequality, the square 
of the second term of (25) is bounded by 



E— max 

T g =0,...,[T*]-l 



< 



T 



[T<]-1 

E E 

q=0 



[T^+qT/lT*] 

j=[T' 3 ]+l 
[T^]+qT/[T^] 

E cf^i^-^) 

J=pv»]+i 



by (24). But, using Lemma 3, we have that the right-hand side of the last 
displayed inequality is bounded by 



J <j=0 



+ 9 



1/2^1/2(1-?) 



< K log 4 T(T ?_/3 + T^ 1/2 ) < ifT~ 



25 



where |g|+ =max{l, \q\}. To complete the proof, we need to show that the 
first term in (25) is 0(T~ 5 ). To that end, we note that this term is bounded 
by 



E— — max max 
T 1 / 2 g =o,...,[^]-i s 



E Cj^VjiUj-Vj) 
j=l+[T0]+qT/[T^] 



where the max s runs for all values s = 1 + [T 13 ] + gT/[T f ], . . . , [T 13 ] + (q + 
1)T/[T">]. By the Cauchy-Schwarz inequality and (24), the square of the last 
displayed expression is bounded by 



i [ts]-i pr"]+(«+i)r/[r^] 

9=° s= l+[T/9]+qT/[T<;] 



E 



j=l+[TP]+qT/[T<] 



4 ^ [T^]-l [T0]+( 9 +l)T/[T<] 



<*^E 



E 

9-° s=l+[T/*]+gT/[T«] 



X T (l-?)/2 
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< K l ^L(f 1 -<logT + f 3 ^/ 2 ) < Kf^-^l 2 \og A T< KT~ 25 , 
where in the first inequality we have used Lemma 3 and that, for q > 1 and 

u /lT?]+(q+l)T/lT<] \ 

y — tL y i 

j=l+[TP]+qT/[T<:] V ' X j=l+[T0]+qT /[T<] ' 

This completes the proof. □ 

Remark 1. Lemma 4 holds for a^(A) and q^(A) replaced by 

ciy(A) := atj,(jr) — Oy(A), &tW := "r( 7r ) ~~ &tW> 
respectively. This is so because the triangle inequality implies that 

E sup \a C T (X) -fir(A)| < 2E sup |a^(A) - a^(A)|. 

Ae[0,7r] Ae[0,7r] 

Define, for fx and i9 G [0, 7r], 

2 [T*A] 

(26) c s (/x,tf) = ^~^ y CpCos(sAp), 

p=[T M /7r]+l 

where £ is as in Lemma 1 and 

Lemma 5. For <//< #i, #2 < tt ; as T— > 00, 
r-iT-t 

(27) £ £ c a (jA,0 i y s (ji,0 2 ) =3(^^i,^2)(l + o(l)), 

t=l s=l 



wftere 5(^,^1,^2) = vr" 1 J^ 1 ^ 2 CNC'W d« - (tt" 1 f* 1 ((u) du)^- 1 x 
Proof. A typical component of the matrix on the left-hand side of (27) 



is 



T 2 T 



[Ti?i/tt] [Ti?2/tt] T-lT-t 

E cg l} E c£ 2) EE cos (*V)cos( s a P2 ; 



pi=[T|0r]+l p 2 =[T M /7r]+l *- X 8 ~ 1 
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4 [TiV7r]A[Ttf 2 /7r] T-lT-t 



E W } EE«»H) 



(28) 



2 [Ttfi/V 



E 4*° 



pi=[7>/7r]+l 

[Ti9 2 /7r] T-lT-t 

x E CS 2) E E{ cos ( sA pi+ P2 ) + cos(sA pl _ P2 )}. 

p 2 = [T M /7r] + l i=1 5=1 

P2^Pl 

Because cos 2 A = (1 + cos(2A))/2, then using formulae in [6], page 13, we 
have that Y,J=i £2=1 cos 2 (sA p ) = (T — l) 2 /4 and, for p x / p 2 , 

T-lT-t 

E E{ cos ( s V+p 2 ) + cos(sA Pl _ P2 )} = — T 

t=l s=l 

and, hence, we conclude that the right-hand side of (28) is, recalling that 



T = [T/2], 



2^2 I j S P S P I 



2 [Ttfi/Tr] [t# 2 /t 



[Ti?i/7r]A[Ti? 2 /7r] 

E 

P=P>/tt]+1 

E 1 #° E cg 2) 



pi=p>/7r]+l p a =[T/i/7r]+l 

P2^Pl 

= ^ ta >(Mi,02)(l + o(l)), 

by Lemma 1 and where g^ kl ' k2 \n,'&i,'&2) denotes the (/ci,A;2)th element of 
the matrix g(n, $2)- d 

We now introduce the following notation. For < v\ < V2 < 7r, 

/ 1 PW*\I \ /™l/2 T \ 

(29) £ llT («i,«!i):=[- E C P )(— E^*)-* 2 ))' 

^ P=[T«i/tt]+1 / V t=l / 

T t-1 

(30) £2,1^1,^2) := ^e(t)^2e(s)c t -s(v 1 ,v 2 ), 

t=2 8=1 

where q(-, •) is given in (26) and £ is as in Lemma 1. 
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Lemma 6. Let < v\ < v < V2 < 7r. Then assuming Al, for k = 1, . . . ,p 
and for some (3 > and < S < 1, 

(31) E(\£^( Vl ,v)f\£^(v,v 2 )f) < K(v 2 - Vl ) 2 ~ s , j = 1, 2, 

where s[ k ^(vi,v) and £^{v\,v) are the kth components of (29) and (30), 
respectively. 



Proof. We begin with j ' = 1. By Lemma 1, 

E cf- 1 / c (fc) w^ 



T 

p=[Tui/7r]+l 



after we notice that we can take T 1 < (v2 — v±), since otherwise (31) holds 
trivially. On the other hand, Al implies that E(J2f =1 (e 2 (t) - a 2 )) 2 < KT. So, 
using the inequality (v2 — v)(v — v±) < (v 2 — v\) 2 and the Cauchy-Schwarz 
inequality, we have that E(\£[ k j,(vi,v)\\s[ k ^(v,V2)\) <K(v% — v\) 2 " 5 . 

To complete the proof, it suffices to examine that the inequality in (31) 
holds for j = 2. Now 

4 

E(£$ (vi, v 2 )f = 16 <$?-i {vuv 2 )n<ti)<si) . . . e(U)e(s 4 )). 

j=l l< S j<tj<T 

Since the number of equal indices in the set {£i,si, . . . ,£4,54} does not ex- 
ceed 4, by assumption Al it follows that |E(e(£x)e(si) . . . e(t4)e(si))\ < K. 
Moreover, by Al the inequality |E(e(ti)e(si) . . . £{ti)£{s^))\ 7^ can hold only 
if any tj,Sj are repeated in {£1, si, • ■ • ,£4, S4} at least twice. Hence, by the 
Cauchy-Schwarz inequality, we obtain that 



1/2 

( *L>i,^)) 2 



E(£^( Vl ,v 2 )) A <Kl[( £ (c[ 

j=l \l<Sj<tj<T 

= K( Y (c [ t k J s (vi,V2)) 2 ) . 
\l<s<t<T ) 



But by Lemma 5 the right-hand side of the last displayed equation is bounded 
by 

K [-J (( {k) (u)) 2 du- (- j C {h) (u)duj j KKfa-Vi) 2 - 6 

because | f^ (^ k \x)) p dx\ < K\v 2 — ^i 1 1 ~ <s / 2 for p = 1, 2. This concludes the 
proof choosing [3 = 2 by the Cauchy-Schwarz inequality. □ 
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Lemma 7. Denote rj p := I EjP — a /(2ir) and 

2ir [¥vM 2vr f 

R r( v ) = jZp E (pVp and R t( v ) = 7^ E CpVp, 

P =1 p=[Tv/w] + l 

(0<v<tt) 

with ( as in Lemma 1. Let < v\ < v < v 2 < vr. TTiera assuming Al, for 
some (3 > and < 5 < 1 : 

( a ) E(ii4( V2 ) - 4(tof n4w - 4(^i)f ) < * - ^i) 2 " 5 , 

(32) 

i = i,2. 

(b) 4(«) 4^(0, 4ttV^)), j = 1,2, 

where (v) = a 4 Jq ((u)(,'(u) du/ir + o~ 4 k Jq ((u) du Jq ('(u) du/tr 2 and 
V®(v) = a 4 f* C(u)C(u)du/TT + a 4 Kj^ £(u)duf* C'{u)du/n 2 , with k denot- 
ing the fourth cumulant of {e(t) / 'a}t^z- 

Proof. We begin with (a). We shall consider Rjp(v) only, R^(v) being 
similarly handled. Prom the definition of r] p , and 

27r [Tvi/ir] 

4W-4W = ^ E C P v P , 

p=[Tv/n]+l 

we have that 

Rt(v) - Rt{v 2 ) =£i,t(v,v 2 ) +£2,t(v,v 2 ), 

where £\,t{v,V2) and £2,t(v,v%) are given in (29) and (30), respectively. 
Now (32) follows immediately from Lemma 6 and standard inequalities. 

Part (b). We will examine R^(v) ^M(Q,A-k 2 V^ 1 \v)), the proof for j = 2 
being handled identically. But this follows by an obvious extension of The- 
orem 4.2 of [14] because C( u ) satisfies the same conditions on h n (u) there. 
□ 

Lemma 8. Assume A1-A4. Then we have that, for some < 5 < 1/6, 
2 n [ TA H ( I ■ a 2 \ 2tt f TA ^ / a 2 \ 

(a) W 2 § Cj " ) = W 2 § Cj V £J " 27 j 

(33) 



9 A/TTj \ 

fr E G^Jr 1 ^ 
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(b) 



— V C 

j = [TA/7T]+l 



j=[TA/7r]+l 



T l/2 



E ca.> 1/2 (^-^o)+Op(^), 



j=[TA/tt]+1 

where the O p (l/T s ) terms are uniform in A G [0,7r], and where ((u) and 
\\C(u)\\ are as in Lemma 1. 

Proof. We examine (a), part (b) being handled similarly. The difference 
between the left-hand side of (33) and the first term on its right-hand side 
is 



(34) 



TV2 £^ 



3=1 



+ 



T l/2 s 3 
J 3=1 



'*,3 



do, j 



9^ [ TA /H T 



0) 



First we notice that 

(35) T -^o = O P (T- 1/2 ), 

which follows by (8) in assumption A4, and because 

T 



(36) 



1 



E 



(recall that under H , hj = hg 0i j), by Lemma 4 and Markov's inequality, and 

T 



Ix,k 



I £>k \=O p {T-°) 



2tt 



(37) 



vr Jo 



u)B n (du) 
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by Lemma 7 with ((u) = cf>g (u). Notice also that J2k=i <M),fc = 0(\ogT) by 
Lemma 1 because (9) and A3 part (c) implies that <j)Q (A) satisfies the same 
conditions on £(A) in Lemma 1. 

Next, A3 part (d) implies that, uniformly in A G [0,7r], the norm of the 
first term of (34) is bounded by 



[TA/tt 



(38) 



Kf^\\e T -e fi £ liog^Hioil^ = o p (T-n 



J'=l 



he ,j 



because (35) implies that we can take 5 = KT~ 1 / 2 in A3 part (d) so that 
A7" 5 < K when 5 < KT~ 1 / 2 and j > 1, and also because by Markov's in- 
equality and Lemmas 4 and 7, 



sup 

Ae[0,7r] 



[T\/*] 



0o,j 



a 
2^ 



O p {T 



-1/21 



and because by Lemma 1 with ||£(n)||| log 2 (n)| there, 



sup 

Ae[0,7r] 



? El^A J |||C,||--/ o |lo g 



(u)\\\((u)\\du 



o(T 



-1/2^ 



The second term of (34) is O p (T s ) by Lemma 4 and Markov's inequality. 
Next, proceeding similarly as in (38), since C(A)^ o (A) satisfies the same con- 
ditions as C(A)| log A|, the third term of (34) is f ~V 2 jjj=t ] Cj(>e ,jT l/2 {B T - 
6q) + O p (T~ s ), which concludes the proof. □ 

Lemma 9. Assuming Al, for any < v < (1 — <5) /4, with 5 as in Lemma 7, 
we have that, for all k = 1, . . . ,p, 



(39) (a) E 



(40) (b) E 



(vr-Ai)« (tt-A 2 )< 

ggj(Al,7T) gff(A 2 ,7T )- 

(tt-Ai) w (vr-A 2 ) 1 



<^(A 2 -Ai 



i2-5-2u 



<if(A 2 -Ai) 



2-S-4v 



for all < Ai < A 2 < vr, and where j*(Ai, A 2 ) and ^ y(Ai, A 2 ) are given in 
(29) and (30), respectively. 



Proof. We begin with (b). By standard inequalities the left-hand side 
of (40) is bounded by 



ATE 



£fJ(Ai,A 2 )) +K 



7T - Al)- 



1 



1 



(tt-Ai) u (vr-A 2 ) J 



E(^(A 2 ,vr)) 4 . 
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By Lemma 6, for any < 5 < 1, we have that the last displayed expression 
is bounded by 

(>-2-\l) 2 - S . ,,l 1 1 \* , ,2-5 



Consider the case A 2 — Ai < 2~ 1 (-7r — A2) first. By the mean value theorem 
(41) is 

K ^-Xx) 2 - S 



+ 



(vr-AO 4 - 

K v A (\ 2 - \\Y 



(vr - Ai) 4 «(vr - X 2 ) s +^~ 2 (J3(n - Ai) + (1 - /3)(tt - A 2 )) 4 " 4 « 
< K{\ 2 - Ai) 2 " 5 " 4 " + K(tt - A 2 )- <5 - 4 "- 2 (A 2 - Ax) 4 , 

where f3 = f3(\\, A 2 ) G (0, 1), and then because tt — Ai > A 2 — Ai and tt — Ai > 
7T — A 2 > 0. But the right-hand side of the last displayed inequality is bounded 
by K(\ 2 - Ai) 2 "- 5 " 4 " since A 2 - Ai < 2~ 1 (ir - A 2 ). 

Next, consider the case for which 2~ 1 (ir — A 2 ) < A 2 — Ai. Using the in- 
equality — < (a — &)■> for any < q < 1 and a > b, we have that (41) is 
bounded by 



n,2-5-4« , ^ (A 2 ~Ai) (vr-A 2 ) . -,2-S-Av 

(tt - Ai) 4 "(7r - X 2 p v 



where we have used < A 2 — Ai < tt — Ai and tt — A 2 < 2(A 2 — Ai). This 
completes the proof of part (b). 

Next part (a). By definition and Al, the left-hand side of (39) is bounded 

by 

[TA2/71-] 



* (I y c (fc) i 

V j=[TXiM+l 7 



+ ^l(7r-A 1 )- (7r-A 2 )«J (f . r X , Cj 



\ 2 M ^ .(*) 



2 



j=[TA 2 /7r]+l 



n2-<5-2»j 



<if(A 2 -A ly 
by Lemma 1, and then proceed as in part (b). □ 

In what follows we shall abbreviate j'g Ag}p(q) by Hg t j<(q). 
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Lemma 10. Assuming A1-A5, for all e > 0, 



lim limsupPr< sup 

Ao-»7T T^oo l,Ao<A<7r 

(42) 



[TX/n] 

E 



He ,T(k) 



T — T l l 2 

fc=[TA /7r]+l 



T 

X J2 700,. 



j=k+l 



h$ T ,j 2ir 



> e 



0. 



Proof. Abbreviate h g ^ jlx,j — Ie,j by and take Ao > vr/2 without 
loss of generality. Noting that Kq~ t jlx,j — cj2 /(27r) = Xj + r/j, where rjj = 
I £ j — a 2 /(27r), we have 



sup 

Ao<A<tt 



[TX/n 



l ^ He T (k) s—^ 



t j-^ 1 r 1 / 2 

fc=[TA /7r]+l j=fc+l 



T 



(43) 



fc=[TA /7r]+l 



-if 



sup 

jrA /7r]<fc<T 



[1-k/T)- 5 / 2 * 
j=k+i 



T l/2 



+ sup 

[TA /tt]<A;<T 



{i-k/Ty 5 / 2 * 

7frT2 ^ 7(? °'^ 

J j=fc+l 



for any < 5 < 1. The first factor on the right-hand side of (43) is bounded 
by 



K 



rp ^ ^ 

fe=[TA /7r]+l 



l79 ,fcll(l-|) 



using 



< K 



T - [TAq/tt] 



T 



5/2 



because ||^4g (A)|| > AT~ 1 (7r — A) by assumption A5 and because Lemma 1 
implies that $up [TXoM < k < T \\Ae 0>T (k) - A 0Q ([kTv/f])\\ = 0(r -1 log 2 T). 

Next, by Lemma 9 the second term inside the braces on the right-hand 
side of (43) is O p (l) for 5 > small enough, whereas Lemma 8 and (35) 
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imply that the first term is bounded by 



sup 

[T\ /Tr]<k<T 



~ 1^ ^o,iK,i 



+ o z 



sup 

[TX /ir]<k<T 



j=k+l 

;i - k/fy 5 / 2 



o P (i) 



y<5 



Opdyr-Aol 5 / 2 ), 



because of T _1 < T _1 < '^i^XoM<k<T0- — k/T), < S < 1, and an obvious 
extension of Lemma 1 but with ("(A) = -ygip (A)0^ o (A) there. So, (43) is O p (\ir — 
Xq\ S ), which implies that (42) holds because 5 > 0. □ 



Lemma 11. Assuming A1-A6, 
f 

' J «tJ ~ 960, j j 



(44) sup 
Ae[o,7r 



1 



T l/2 



E 



ffrj <x 2 A 
\he T ,j 27r/ 



/logT 



j=[TX/ir}+l 

Proof. The expression inside the norm on the left-hand side of (44) is 



T l/2 



E 



(45) 



j = [TX/7T] + l 
1 



+ 



+ 



T l/2 
1 

f 1/2 



Mod 

f 

E 



6<i,3 



L e,3 



j= [T\/n}+l 
f 



V0i3 \ led 



<J- 

2^ 



h - 6o) 



E (<t>6 T ,j-4>d Q ,j-</>do,j(0T-0o)) 



j=[TX/n]+l 



he T ,j 2tt 



By A6 and then noting that \a — b\ < (a — b) + 2b for a > and b > 0, the 
norm of the third term of (45) is bounded by 



K 



\\Br 



|2 T 



T l/2 



E I lo g(^- 



3=1 



Ix, 



he 



T,J 



a 
2^ 



< K 



\\0t 



12 r T 



O r 



T l/2 

logT 

T l/2 



1x4 ^ 



l 6 T ,j 



,2 T 



^)+-Eii°gA, 



7r r 1 
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by (35) and then using Lemmas 8 and 7 with C(A) = | log A | , and Lemma 1, 
respectively. So, uniformly in A, the third term of (45) is o p (l). Likewise, 
the first term of (45) is O p (T -1 / 2 ) uniformly in A using Lemma 8 with 
C(A) = 4>e Q W and (35). Observe that <^e (A) satisfies the same conditions as 
C(A) in Lemma 8 by A6. Finally, the second term of (45) is O p (T~ 1 / 2 ) by 
Lemma 7 with £(A) = 4>g (\). □ 

Lemma 12. Assuming A1-A6, for all e > 0, 



lim limsupPr< sup 

A -»7T T^oo (Ao<A<7t 

(46) 



1 



[T\/tt] 

E 



H e T ,T(q) 



<?=[TA /7r]+l 



X E Tftro 
j=q+l 



0T,j 



a 
2^ 



0. 



Proof. Notice that (35) implies that it suffices to show (46) in the set 
{\\0t ~ 0a\\ < KT' 1 / 2 ™^ 1 }, where mT + m^T" 1 / 2 -> 0. On the other hand, 
Lemma 11 and then Lemma 8 imply that, uniformly in q, 



T l/2 



(47) 



E 

3=9+1 

'? 



1 j=g+i 



\T^(e -e T ) + o p (T- & ), 



i i 



1 j=q+l 



Tl/2 - 
J i=9+l 



proceeding as in the proof of (44) but with Xj + ?/j replaced by r/j there. 
Observe that we can take Ao > it/2. Next, uniformly in q, A6 implies that 

sup _\\Ae T>T (q) ~ Ag 0>T (q)\\ = (vr - X )O p (\\9 T - 6 \\), 

[T\oM<q<T 

which will imply that, with probability approaching one, as T — > oo, 
\A7'(q)\\ < \\A-Uq)\\(l + KT-^m- 1 ) < K (l-V~ 



because ||^4e (A)|| > K — A) and Lemma 1 implies that 

sup _\\A e(h T(q)-A do ([q7T/f})\\ =0(T- 1 log 2 T). 

[T\oH<q<T 

So, we have that, for < 5 < 1/2, 



sup 

A <A<7T 



1 



[TA/tt] 

E 



6 T ,T 



(<?) 



T _ *~ ' T 1 / 2 

?=[TA /7r]+l 



T 

E ^ T ,j 

j=q+l 



r 2 
2vr/ 
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[5Vtt] „ x -1+5/2 



< K sup 

A <A<7T 



(48) 



g=[TA /V]+l 



sup 

.[TA /7r]<g<T 



-6/2 1 T 

+O P (|vr-A | 5/2 )|, 



by (47) and because T 1 < T 1 < infr= Ao / 7r i< g <r(l ~~ l/^)- But Lemma 9 
implies that 



sup 

[T\ /Tr]<q<T 

and A3 implies that 



T 



j=q+l 



o P (i), 



sup 



A <A<7T T = 

<j=[TA /7r]+l 



E Il70o,?ll 1 



' 1+</2 <g f r-|TAo/7r] \' 
T 



and, hence, the left-hand side of (48) is O p (\tt — Ao| 5 ^ 2 )- From here we con- 
clude that (46) holds because 5 > 0. □ 

7. Proofs. This section provides the proofs of the main results which are 
based on the series of lemmas given in the previous section. 

PROOF of Theorem 1. Part (a) follows by Lemma 4 with £(A) = 1 
there. The proof of part (b) follows immediately from part (a) and Lemma 7 
with C(A) = 1 there. □ 

Proof of Theorem 2. Part (a). By Lemma 8 with ("(A) = 1 there and 
the definitions of Gg^ an d Gj>, we have that 

[TA/7T 



(49) 



/ 2 [1 A/ 7r J v _ 

[j E <t>'e , j )T 1/2 (e T -e Q ) + Op (i) 



% E ^ojI^t 1 



[TA/t, 



2tt 



i=i 



G eo , T (7T)Tl/2 ^ 



E foo.k 



Ix,k 



+ o P (l), 
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by (8) and (9), and where the o p (l) term is uniform in A G [0,ir]. Likewise, 

(50) f 1 / 2 (G eTiT (7r)-G^(7r)) = 0p (l) 

because of (36) and (37) and, by Lemma 1 with ("(A) = 4>q (\) and (9), 

we have Wf' 1 J2j=i &e ,j\\ =0{T~ l \ogT). So, (50) holds. Also, it is worth 
noticing that Lemma 1 with £(A) = cpg (X)4>'g o (A) implies that \\St — ^d \\ = 

oiT- 1 log 2 r). 

On the other hand, noting that (50) and Al imply that 

(51) G° t (tt) = <j 2 + O p (T- 1 / 2 ), 
and that 

l^„,TW-G°TWI = «p(r 1/2 ) 

by Lemma 4, then by (49), (50) and (36), uniformly in A, we obtain that 
ae T ,T(A) = a T (X) H G^Jtt) 

(52) +G ^ fl, ia^-G^) 



[TX/tv] [~ 



27T 



i=1 L ^ H 



+ Op(l), 



which concludes the proof of part (a). 

Next part (b). Taking into account part (a), part (b) follows because 
Lemma 7 guarantees the fidi's convergence of otj. and its tightness. Tight- 
ness of the second term on the right-hand side of (52) follows by (37) and 
Lemma 1 and then because Jq 4>g (u) du is Holder's continuous of order 
greater than 1/2 by A3. This concludes the proof of the theorem. □ 

PROOF of Theorem 3. Using (51) and recalling that H 6iT (j) = 7^ jAj\,(j), 
we obtain that 

(53) 

1 f 27T 

-H 9o>T (j)-~ 70o,fc(-2 J £ ,fc- l) ] +Op(l)> 

1 k=j+l v 



where the o p (l) term is uniform in A 6 [0, tt]. 
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Suppose, to be shown later, that the convergence in [0, Ao] holds for any 
< Ao < 7r. Then, because B w and the limit of the process T -1 / 2 J^}j=/ n ^ ~ 
a 2 /2n) are continuous in [0, 7r], Billingsley's [3] Theorem 4.2 implies that it 
suffices to show that, for all e > 0, 



lim lim sup Pr < sup 



X <X<W 



[TX/tt] 

E 



He , T (j) 



j=[T\ /n]+l 



x E 7e ,fe( -T J e,fe-l 
fc=i+i 



which follows by Lemma 10; compare the second term on the right-hand 
side of (43). 

So, to complete the proof, we need to show that, for any < Aq < vr, 



1 



T l/2 



(54) 



[TX/n] 

E 



^1 -1 



fe=i+l 



2vr 



7T 



-^■^(A), 



in [0, Ao] - Fidi's convergence follows by Lemma 7, part (b) after we note that 
the second term on the right-hand side of (53) is 



1 T / 1 fcA[TA/7r] \ 

7^J2[f E i H 9 ,T(j)jle ,k 



2tt 



Ic,k 



and (j>-i ^k^pTA/jr] Hg 0i T(j))^/e ,k satisfies the same conditions of Lemma 7 
for C(A), for example, those of h n (X) in [14], Theorem 4.2. Then, it suffices 
to prove tightness. Since is tight, we only need to show the tightness 
condition of 



(55) 



1 



[TX/tt] 



A HA)=~ E Ho ,tU)\ 



1 



T 



3=1 



T l/2 



k=j+l 



By Billingsley's [3] Theorem 15.6, it suffices to show that 

E(|A r (0) - A r (/i)||A T (A) - A r (0)|) < K|A - //| 25 

for allO</i< - i?<A<7r and some <5 > 1/2. Observe that we can take T _1 < 
| A — fi\ , since otherwise the last inequality is trivial. Because (A — 1?) (i? — /x) < 
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(A — /i) 2 , by the Cauchy-Schwarz inequality, it suffices to show the last 
displayed inequality holds for E|Ay(A) — Ax(fJ,)\ 2 , which is 



i E h 60)T u) E E 



[TX/n] 



^3 ^ w ^ ^ l6o,hle ,£ 2 

j,k=[Tn/n] + l t x =j+\i^=k+\ 



X E 



cr 



2tt 



<5 £ ||^ 0)T (i)||||^ , T (fc)|| 

J,fc=[T/ i /7T] + l 



[TA/tt] 



<K{\H{\)-H^)\ 2 + T- 2 \og 2 T), 



where 



/■A _ 
H{\):=n~ l H 0o (x)dx and ||fT T (A) - #(A)|| < ivTT" 1 log T 



and 



[TA/rr] 

H T (\):=T~ l \\Ho ,T 

3=1 

by Lemma 1. From here we conclude by Billingsley's [3] Theorem 15.6, be- 
cause H(\) is a monotonic, continuous and nondecreasing function such that 
\H(X) - H(n)\ < K\\ - n\ 5 , 5>l/2 and f- 1 < |A - n\. □ 

Proof of Theorem 4. By definition of f3g iT and it suffices to show 
that 



(56) 
and 



1 



T i/2 



Ix,k 



[TX/tt] 



1 



4,fe) -H 6o>T (k)~ E 7*bJ 



j=fc+l 



n 8 T ,j 



(57) 



1 



Go t ,t(k 

[TA/tt 
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converge to zero uniformly in A € [0,7r]. Expression (56) is o p (l), uniformly 
in A £ [0, 7r] , because the contribution due to the term in brackets in the last 

line of (52), that is, -^^(G^ir))- 1 S^f-V^LiK^k, is easily 
seen to be zero. Next, because 

~ Y he ,k\\\\ A eo,T( k )\\ ~ Y H^ojll 

1 k=l j=k+l 

1 [TXM ( k 

1 k=i v 1 

[TX/tt] 
~ E ho ,k\\< K 



T 



k=l 



by integrability of 7g and ||j4g — k/T)~ 1 \\ > by A3 and A5, it 

implies that the contribution to (56) due to the term o p (l) on the right- 
hand side of (52) is negligible. 

Next we examine (57). Because of (50) and (51), it suffices to show that 



(58) 



Ho QjT {k) 

T l/2 



Y ^o„ 



j=k+i 



hg T j 2tt 



He T , T (k) 



T 



Y T«TJ 



1 j=k+l 



2tt 



he 



T.J 



converges to zero uniformly in A £ [0, it], after observing that 

T [TXM t 



sup 

Ae[0,7r] 



[TXM T [TXM 

Y H e T A k ) Y ^e T ,j- Y H 8o,r( k ) Y T»oJ 

k=l j=k+l k=l j=k+l 



0. 



First, we observe that Lemmas 10 and 12 imply that it suffices to show 
the uniform convergence in A £ [0, Aq] for any Aq < it. But (58) is equal to 



1 [TXM] 1 T 

(59) ~ Y Y irf6 ,j-le T j) 

1 k=\ 1 j=k+l 



he T j 27T/ 



j [TXM] i T 

( 60 ) +^ Y (%(*)-%(*))^ Y 7flbJ 

1 k=l 1 j=k+l 



h$ T ,j 27r 



So, the theorem follows if (59) and (60) are o p (l) uniformly in A £ [0, Aq]. 
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(61) 
(62) 
(63) 



1 



sup ~ E 

Ae[o,7r]T j=1 



»e T ,j\ 



Op(!)> 



sup \\Aj o ] T (X)-Aj o 1 (X)\\=o(l), 

AG[0,A ] 

sup \\A£ t M ~ Ag ] T (X)\\ = Op(l). 

AG[0,A ] 



(61) follows proceeding as with the proof of (44) in Lemma 11, but without 
the factor hg^ jlx,j — f 2 /(2vr); (62) follows because assumption A5 implies 
that Aq (\q) > and because, by assumption A3, ||00 o (A)<^ o (A)|| satisfies 
the same conditions on C(A) in Lemma 1, so that 



sup ||^ (A)-4, 0iT (A)|| =0(T- i log^T); 

AG[0,A o ] 

and (63) follows proceeding as with the proof of (61) and (62). 

Now we show that (59) is o p (l) uniformly in AG [0, Ao], which follows by 
Lemma 11 and (61)-(63), noting that (7^ - 7^.) = (0,4>' eo j - <t>'e T j)\ so 
does (60) by (61) and (63) and noting that 



sup 

Ae[0,7r] 



1 



T 



rpl/2 



E 



o P (i) 



j=[TA/ir]+l 

by Lemmas 7 and 8 with £(A) =7e (A) there and observing (35) and that 
by Lemma 1, f~ l Ej =[ y A/7r]+1 iBoj^j ~> S\ 7e (xWo ( x ) dx - u 

Proof of Theorem 5. Under Hit, we have that, by definition, 



9 ^ PTA/ir] T o pTA/ir] 



2-KT \ - , 



T 3/2 



hj 2ir 



3 =i j 



By Lemmas 1, 4 and 7 with ("(A) = r/(A), and because |sy | is integrable, we 
have 

2tt [¥xM <j 2 t r x 

Go ,tW = ^ E ^ + wjt *(«)** + <*(r~ 1/2 )- 
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So, using (51) because Jq l(u) du = 0, we have that, uniformly in A 6 [0, it], 

+ Op(l) 

= (A) + - / /(n) du + oJl). 
vr Jo 

From here the conclusion is straightforward. □ 
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